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^ ' ABSTRACT 

O I We present an 850 /jm map and list of candidate sources in a sub-area of the Groth 

Strip observed using SCUBA. The map consists of a long strip of adjoining jiggle- 
maps covering the southwestern 70 arcmin 2 of the original WFPC2 Groth Strip to an 
average 1 a rms noise level of ~ 3.5 mJy. We initially detect 7 candidate sources with 
signal-to-noise ratios (SNRs) between 3.0 and 3.5 a and 4 candidate sources with SNR 
> 3.5. Simulations suggest that on average in a map this size one expects 1.6 false 
positive sources > 3.5 a and 4.5 between 3 and 3.5 c. Flux boosting in maps is a well 
' known effect and we have developed a simple Bayesian prescription for estimating the 

C*"** ' unboostcd flux distribution and used this method to determine the best flux estimates 

. of our sources. This method is easily adapted for any other modest signal-to-noise 

survey in which there is prior knowledge of the source counts. We performed follow-up 
photometry in an attempt to confirm or reject 5 of our source candidates. We failed to 
significantly re-detect 3 of the 5 sources in the noisiest regions of the map, suggesting 
that they are either spurious or have true fluxes close to the noise level. However, 
we did confirm the reality of 2 of the SCUBA sources, although at lower flux levels 
' than suggested in the map. Not surprisingly, we find that the photometry results are 

consistent with and confirm the de-boosted map fluxes. Our final candidate source 
q ■ list contains 3 sources, including the 2 confirmed detections and 1 further candidate 

!— i ' source with SNR > 3.5 a which has a reasonable chance of being real. We performed 

"o^ \ correlations and found evidence of positive flux at the positions of XMM-Newton X- 

■ ray sources. The 95 per cent lower limit for the average flux density of these X-ray 

sources is 0.8 mJy. 

Jv>( , Key words: submillimetre - surveys - cosmology: observations - galaxies: high- 

■ redshift - galaxies: starburst - methods: statistical 

| 1 INTRODUCTION Bolomet er array) counterparts to many of the se SCUBA 

„„ TT „ sources IIGreve et al . 2004). Mort ier et al.l 112004) also report 
Large blank-field SCUBA surveys have revolutionized our . . r r , A T n ■ n 

, , , not recovering a similar fraction of fow SNR sources m the 

understanding of the importance and diverse nature of dusty o t r. ■ i i • i -,i mr a T^nn 

... , , ., , ITTr~. n u^,J IT, Tl 8-mJy Survey region when combined with newer SHADES 

galaxi es at high redshif t s (e.g. plain et al j| 2002| , | Scott et alj fsriTTR A H A If TW«> FVfr^Wti, S,,™,,! h, thn 



,Y ft L I L^rXT; i T (SCUBA HAlf Degree Extragalactic Survey) data in the 

2002;. IWebb et alj |2003. Bor vs et alJ l2003). Only about 300 v ,,, n , & . 6 , „ //, „ „. , , 

TT — , _ „„,„' , , , , same held. Onfy one region, namefy the Hubble Deep f^iefd 

blank-held SCUBA galaxies have been discovered since the / T ttm^\ • ^^-.^t^o at ^ n i j t i • ^- ± i 

. . . , (HDF) region or GOODS-North held, has been investigated 

instrument was commissioned, m contrast to the tens of . , , , „ . , rr — 71 

. , . ' . , independently by different groups. Reassuringly, Borvs ct al. 

thousands to millions of objects detected m optical surveys J2003ll2004 ) and lWang. Cowie fc BargeH (2004) are in close 
of similar sizes. Nevertheless, the number counts have been 
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, , , , , , agreement for the higher SNR sources m that region. How- 

wefl-charac tensed, and progress is being made in identity- ., .. ,. ., ... , 

, ? , ever, there is some disagreement regarding the reality and 

mg SCUBA galaxies with objects m other wavebands using ,, „ , ... c , r , , . . mi r 

° , , the nux densities of several of the noisier sources, the et- 

source positions derived from radio identifications. rj rn , ,. , . . , , T , . . , / 

TT . — . . tects of ffux boosting (sometimes called Malmquist and/or 

However, the fact remains that SCUBA sources are dit- ,-, , ,. x , r att-> h i u i 

bddmgton bias) are wefl known tor SNR-threshofded maps, 

ncult to find, and when they are detected they typically have , ., . ,, , ., . . ,. ... ... 

/ and it is worthwhile to investigate whether such discrepan- 

low signal-to-noise ratios (SNRs), bringing into question the . , 1111 

° _ _ . _ , cies are to be expected, and when one should be confident 

reliability of measurements. Using deep radio imaging of the , . ... r . , , . r , .. , 

J I • f\ — —-5 ° ° about the reality of a source, independent of whether it has 

8-mJy Survey fields. Ivison ct al. (2002) have suggested that ,. . , ,. T , . , . c , , . n 

J J ' ™ °° a radio identification. It is clear that a careful, un- biased 

low SNR sources in relatively noisy regions of submillime- , . ,. , r . , , ,. . , nl 

J jo anafysis of the robustness of SCUBA detections m shaflow 

tre maps which lack radio counterparts are often spurious. . „ , . . „ . . . , r , 

maps is called tor. fn this vein we provide a caretuf estimate 

Additional evidence that these sources might be spurious rau^- j^ruj - j 

„ . „ of ffux boosting and have followed up m photometry mode 
comes from the lack of MAMBO (Max Planck Millimeter 
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5 sources detected in a shallow 850 fj,m map of the Groth 
Strip, in an attempt to quantify the amount of flux boost- 
ing present in the map. To interpret the results we have 
developed a general method to assess the reliability of low 
SNR sources. 

We apply these techniques to a particular SCUBA sur- 
vey in the 'Groth Strip'. The 'Groth Strip Survey' (GSS) is 
a Hubble Space Telescope (HST) programme (GTO 5090, 
PI: Groth) consisting of 28 overlapping HST Wide Field 
Planetary Camera 2 (WFPC2) medium-deep images, cov- 
ering an area of 113arcmin 2 , forming a long strip centred 
on RA=14 h 16 m 38 , !8, Dec=52°16'52" (J2000), at a Galac- 
tic latitude of b ~ 60°. The GSS was the deepest HST 
cosmological integration before the HDF, reaching a lim- 
iting Vega magnitude of ~ 27.5-28 in both the V and I 
bands jGroth et al.ll 994). The GSS has an enormous legacy 
value, since extensive multi-wavelength observations centred 
on this region have been conducted or are planned. Mor- 
phological and photometric information from the WFPC2 
images ar e provided by the Medium Deep Survey (MDS) 
database jRatnatunga. Griffiths fc Ostrandeill999|) and the 
Deep Extragalactic Evolutionary Probe (DEEP 1 ) survey 
iSimard et al] [2002) . X-ray sources have also been iden- 
tified in an 80 ks XMM-Newton observation of the GSS 
jMivaii et aljl2004l) . The GSS is currently part of the on- 
going DEEP2 2 survey and is also targetted to be a major 
component of upcoming large surveys in the UV (using the 
Galaxy Evolution Explorer, GALEX 3 ), in the optical (as 
part of the Canada- France-Hawaii Telescope Legacy Survey, 
CFHTLS 4 ), and in the IR (the Spitzer GTO IRAC Deep 
Survey) . 

In this paper, we present 850 (jm SCUBA observations 
of about 60 per cent of the original WFPC2 coverage of the 
GSS. We have also performed confirmation photometry on 
some of the sources. Our goal is to make the 850 /im map and 
source list available to the community so that it may be cor- 
related against existing and future data sets at other wave- 
lengths. No claim is made that this survey is either the deep- 
est or the most extensive performed using SCUBA. However, 
the observations cover enough integration time that we ex- 
pect a handful of real sources to be detected, and our survey 
represents the best submillimetre data likely to be available 
in this field until the advent of SCUBA-2. 



2 MAP OBSERVATIONS AND DATA 
REDUCTION 

A roughly 70 arcmin 2 portion of the Groth Strip (GSS) 
was observed with a resolution of 14.7 arcsec and 7.5 arc- 
sec at 850 and 450 /J,m, respectively, with the 15-m JCMT 
atop Mauna Kea in Hawaii in January 1999 and January 
2000. The GSS SCUBA map is centred on RA = 14 h 16 m 00 s , 
Dec = 52°10'00" (J2000). 

52 overlapping 64-point jiggle maps of the GSS were ob- 
tained, providing measurements of the continuum at both 



1 See http://deep.ucolick.org 

2 See http://deep.berkeley.edu 

3 See http://www.galcx.caltech.edu 

4 See pittpTTwwwxmt . hawaii . edu / Science / CFHLS 



wave lengths simultaneously with SCUBA llHolland et al] 
1999), which has a field of view of 2.3 arcmin. 

The atmospheric zenith opacity at 225 GHz was mon- 
itored with the Caltech Submillimetre Observatory (CSO) 
tau (tcso) monitor. The rcso ranged from 0.03 to 0.09 in 
January 1999 and from 0.05 to 0.08 in January 2000. The 
weather was generally more stable for the latter set of data. 

The secondary mirror was chopped at a standard fre- 
quency of ~ 8 Hz in azimuth to reduce the effect of rapid 
sky variations. The telescope was also 'nodded' on and off 
the source. A 40 arcsec chop-throw was used at a position 
angle of 54° , almost parallel to the lengthwise orientation of 
the strip. Pointing checks were performed hourly on blazars 
and planets and varied by less than 3 arcsec in azimuth and 
by less than 2 arcsec in elevation. The overlapping jiggle 
maps were co-added to produce a final map with a total 
integration time of 18 hours and 50 minutes. 

We used SURF (SC UBA User Reduction Facility; 
Ijenness fc Ligh Hbod 1^9981) scripts together with locally de- 
veloped code (iBorvsl 120021) to reduce the data. The SURF 
map and our map look similar. The benefit of using our 
own code to analyse the data is that it makes a map with 
minimally correlated pixels and provides an estimate of the 
noise in each pixel. We chose 3 arcsec pixels oriented along 
RA,Dec coordinates. This pixel size is slightly too large for 
450 Atm studies, but h as proven to be adequate at 850 fim 
(see iBorvs et alJl2003l) . 

2.1 Flux Calibration 

Calibration data were reduced in the same way as the GSS 
data. The flux conversion factors (FCFs) over 3 of the 4 
nights in January 1999 and all 3 nights in January 2000 
agree with the monthly averages to within 10 per cent (see 
the JCMT calibration web-page). The FCF value for one 
night in January was 30 per cent higher than the monthly 
average and this could indicate that the sky was so variable 
that the tcso was not accurately reflecting the opacity along 
the line of sight to the object. The calibration uncertainty is 
omitted from our quoted error values since it is not a major 
contributor to the global uncertainty of our low SNR data 
and has no effect on our source detection method. 

The 850 fim map has a mean consistent with zero, as 
expected from differential measurements, and an rms of 
3.5 mjy. The final map is shown in Fig. The 450 fim map 
also has a mean consistent with zero and an rms of 50mJy. 

2.2 Source Detection Method 

Given the 14.7 arcsec beam, a high-redshift galaxy will be 
unresolved and will appear as a positive source flanked by 
2 negative sources. The source density at 850 urn (se e e.g. 
IScott et al.ll2002l IBorvs et al-lliool IWebb et al]l2003t) sug- 
gests that only a handful of sources will be recovered in our 
map. Hence we do not expect many overlapping sources, and 
therefore sources were extracted by fitting the raw rebinned 
map with a three-lobed PSF of an isolated point-source with 
the same chop throw and position angle as the map data. 
A fit to the PSF model centred on each pixel is equivalent 
to a noise-weighted convolution and is the minimum vari- 
ance estimator if the background consists of white noise. 
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Figure 1. The 850 /im SNR image of the Groth Strip, smoothed with the 3-beam template (see § 12.21 . The solid 40-arcsec diameter 
circles correspond to candidate sources with SNR > 3.5. Dashed circles indicate the positions of the SNR = 3.0—3.5 candidate sources. 
The inlay in the top right-hand corner illustrates our field geometry relative to some other surveys in this region, including the original 
WFPC pointings (jagged squares), the CUDSS+14 field, CFRS and ISO survey regions (trio of large squares, listed here in order of 
increasing size), VLA coverage (large circle) and XMM coverage (smaller circle). 



An accompanying weighted noise map is created simultane- 
ously and provides an estimate of the noise associated with 
the detection of a point source in each pixel. A peak in the 
PSF-convolved map which is 3 times the noise in that pixel 
constitutes a 3 a detected point source. 



3 SOURCE ROBUSTNESS 

For a list of sources found above a given significance level in 
a map to be useful, one must address the following questions: 
'Are there any statistical anomalies in the data which would 
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cause one to doubt any of the sources?'; 'Given our actual 
noise and measurement strategy, what fraction of sources 
present at any given flux level would we expect to detect?', 
that is, 'How complete is our source list?'; and finally, 'Do 
the fluxes inferred from the maps form a biased estimator 
of actual source flux?' Bias and flux boosting are addressed 
m §IH Quality of fit and completeness are addressed here. 

We investigate source robustness using several tech- 
niques, including spatial and temporal \ 2 tests, searching 
for negative sources, and Monte Carlo simulations. 



3.1 Spatial and Temporal \ 2 Tests 

Although a candidate source may be 'detected' in the map, 
it may not necessarily be well-fit by the PSF, or it may be a 
poor fit to the set of difference data, or both. We have per- 
formed spatial and temporal \ 2 tests in order to determine 
how well the raw timestrea m data fit the final PSF-fitted 
maps. See lPope et alJ i2004) for details. 

The spatial \ 2 test provides a gauge across the map 
of the goodness-of-fit of the triple-beam differential PSF to 
the data, and thus indicates if a source is poorly fit by the 
assumed PSF. We find that the \ 2 values for each pixel 
where a source is detected are within ±2cr (=2^/2_/V p i x ), 
where iV p i x is the of number degrees of freedom or number 
of pixels included in the fit) in all cases, except for GSS850.3, 
a candidate source with SNR=3.3. The poor fit may be the 
result of its proximity to GSS850.4 (see Fig. 0. 

The temporal \ 2 provides a measure of the self- 
consistency of the raw timestream data which contribute 
flux to each map pixel. For example, a portion of the hits 
on a pixel might be consistent with a certain flux value, 
whereas the rest of the hits might be most con sistent with 
a diffe rent value. Following the prescription of IPooe et alJ 
(2004), we calculate the pixel temporal Xi an d the number 
of hits for each pixel N^- lts . We then essentially construct a 
SNR map of poorness-of-fit to the model by using the quan- 
tity Xi ~ -^hits as the 'signal' and y/ 2iVjj its as the 'noise' 
and fitting the PSF to this temporal \ 2 map. We find that 
none of the pixels at the centres of our candidate sources lie 
outside the ±2<j regions of our distribution. We are there- 
fore confident that all of our candidate sources lie in regions 
of the map with self-consistent timestream data and have 
no grounds to reject any of our detected candidate sources 
based on these tests. 

Note that these two tests also check for Gaussianity of 
the noise in the map, but they are not a strong test of this 
distribution. 



3.2 Sources Detected in the Inverted Map 

A quick test of source reality is to create the negative of the 
map and to search for sources using the same triple-beam 
template. Aside from pixels associated with the off-beams 
of positive detections, we find 6 'detections' in the inverted 
map, consistent with the expected number of false positive 
detections in noisy data (see § 13.31 . 
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Figure 2. Cumulative number of detected candidate sources. We 
plot results against SNR, threshold for our map (dark histogram), 
the average expected in source-free simulated 850 (im maps con- 
taining only Gaussian random noise (light dotted curve) , the num- 
ber of false positives expected on average plus the predicted num- 
ber of real sources using the source counts of lBorvs et al.l j200.'Hfi 
multiplied by the completeness estimate (dark solid curve), and 
the confirmed spurious detections (light histogram, see §|1J. The 
number of expected detections and the number of actual detec- 
tions are consistent within the Poisson noise. 



3.3 Monte Carlo Simulations 

Simulations are required in order to evaluate map complete- 
ness and the likely rate of false positive detections, given our 
non-uniform noise . We follow the procedures described in 
iBorvs et alJ l|2003h for investigating the number of positive 
sources expected at random, as well as the map complete- 
ness. 

In order to determine how many detections may be spu- 
rious, we created a map with the same shape and size as the 
real one, but replaced the 850 /im data with Gaussian ran- 
dom noise, generated using the rms of the timestream of 
each file and each bolometer. We then performed the same 
source detection procedure that was used on the real data. 
We repeated this sequence of steps 1000 times and plot the 
cumulative number of positive sources detected on average in 
Fig-HJat each SNR threshold. The simulations suggest that 
on average one expects 1.6 false positive sources > 3.5 a and 
a further 4.5 between 3 and 3.5 a. 

The completeness of a map is the fraction of sources 
which one expects to detect at each flux level. To mea- 
sure this fraction, we added a source of known flux into the 
real map and tried to extract it using our source extraction 
method. We selected the input source flux randomly in the 
range 3-20 mjy, located it uniformly across the map, and 
repeated this procedure 1000 times. A source is considered 
recovered if it is detected with SNR > 3 and located within 
7.5 arcsec (the 850 pm beam HWHM) of the input position. 
We estimate that about 60 per cent of the > 10 mjy sources 
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Figure 3. Completeness of 850 fim source recovery at each level 
of input flux for 3 a detections, as determined from the Monte 
Carlo simulations of individual sources added to the CSS data 
described in § 13.31 



Table 1. CSS 850 fim candidate submillimetre sources with follow- 
up photometry. Locations of these and the other 6 candidate SCUBA 
sources are indicated in FiglTl 



Object 



5 8 so a 
(mjy) 



^850 
(mjy) 



GSS850.1 
GSS850.2 
GSS850.6 
GSS850.7 



9.5 ±2.9 (3.3o-) 
8.2 ±2.3 (3.6 a) 
12.3 ±4.1 (3.0 a) 
13.2 ±3.2 (4.1 a) 



GSS850.11 11.2 ±2.9 (3.8 a) 



-1.8 ±3.6 (-0.5 cr) 
5.7 ±1.1 (5.2 a) 
1.2 ±3.1 (0.4 a) 
4.2 ± 1.7(2.5<t) 
0.1 ± 1.6(0.1(7) 



Flux density estimate from the map. 

Flux density measured from follow-up photometry. 



are detected above a SNR of 3cr in the map (see Fig. 0. 
The completeness is slightly higher than it would be in a 
map with uniform noise at the same rms, as expected. 



4 CANDIDATE SUBMILLIMETRE SOURCES 

We detect 4 candidate sources with SNR > 3.5 a and 7 can- 
didate sources with SNRs in the range 3.0-3.5 a. A submil- 
limetre image of the GSS is shown in Fig.0 where we num- 
ber each of these 11 candidates, while in Table^we present 
information on the 5 sources for which we performed follow- 
up photometry (see S I4.H . The signal and noise maps may 
be downloaded from http://cmbr.physics.ubc.ca/groth 



4.1 Additional Photometry 

In December 2003 and January 2004, SCUBA photometry 
observations were performed in the 2-bolometer chopping 
mode to check some of our candidate map detections. We 
selected 2 candidates, GSS850.7 and GSS850.11, near the 
noisier edge regions of the map and a 'control', GSS850.2, 
in a lower-noise region away from the edges. Also, during 
one of the observing runs in January 2000, four sources were 
identified in the map data 'by eye' and selected as targets 
for follow-up photometry. Only two of these pointings cor- 
respond to candidate detections in the final map (GSS850.1 
and GSS850.6); this is a warning that our eyes often pick out 
bright outliers in noisy regions of a map. All of the photom- 
etry observations were reduced in the standard way using 
SURF. In order to increase the SNR of sources observed in 
the 2-bolometer mode by a factor of approximately ^/3/2, 
we folded in the signal fro m the off-position bolo meters to 
the central bolometer (see IChapman et alJ 120001) . The re- 
sults are listed in Table [T] for comparison with the esti- 
mated map fluxes. In all cases, the photometry pointings 
were within 3 arcsec of the positions found by the source- 
detection algorithm in the map. 



4.2 Flux Boosting in the Map 

It appears that we have detected some sources in the map. 
However none of the candidate sources have very high SNRs, 
and so we need to be careful in interpreting these results. 
Confusion can either increase or decrease the input flux of 
a source, but a noisy flux-limited map will preferentially 
contain sources whose true fluxes have been increased (usu- 
ally called Malmquist bias) and this effect is exacerbated for 
steep source counts. Our source extraction procedure there- 
fore biases fluxes in the map upwards, which we now attempt 
to quantify. 

We performed a set of simulations to assess the expected 
distribution of pixel brightnesses from triple-beam (i.e. dou- 
ble difference) observations of a noiseless blank-sky. We used 
a smooth curve fit to (and mildly extrapolated from) the 
number counts of lBorvs et alJ ll2003l) as an a priori distribu- 
tion of fluxes in the range 0.1-40 mjy. We then populated 
3 different patches of noiseless sky following a Poisson dis- 
tribution, and sampled each area with a SCUBA Gaussian 
beam, taking a double difference each time. This was done 1 
million times and the anticipated prior distribution of dou- 
ble difference flux measurements, N(S P ), is plotted in the 
first panel of Fig. 0] We have also investigated the effects of 
reasonable excursions from the assumed shape of the source 
counts. For example, using the 1 a error bar values at the 
bright end of the number counts has less than a 10 per cent 
effect on the resulting flux estimates. 

The distribution N(S P ), which is the prior probability 
that a pixel in the map has differential flux S p , is calculated 
from noiseless simulations while our actual map contains 
noise. If M is the statement that we measure flux Sm ± cr m 
at some pixel in the map, the probability that the true flux 
of that pixel is S p is obtained from Bayes' theorem: 



P(S P ) = P(S P \M,N(S P )) = 



N(S P ) x P(M\S P ) 
P{M) 



(1) 
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Figure 4. The fi rst panel shows the histogram of the 1 million noiseless triple-beam simulations described in § 14.21 This is the P(D) 
distribution (e.g. IScheueilll974l ICondonlll974l) for a triple-beam experiment, which is strongly peaked around zero and skew positive 
due to sources. The other panels show the resulting flux probability distributions (dark histograms) for the 5 map source candidates 
which had follow-up photometry observations, given their measured fluxes and errors (assumed to be Gaussian distributions, plotted 
separately as light dot-dashed lines) and the underlying source count model. Photometry measurements are overplotted for comparison 
and are also assumed to have Gaussian probability distributions (dark dashed lines). A vertical line at 5 P =0 is plotted as a reference with 
which to compare the photometry measurements. One can see that the combination of the intrinsic distribution of fluxes and the low 
SNR measurements from the map make the photometry measurements much less inconsistent than they might appear (i.e. the dashed 
Gaussians compare well with the solid histograms, even although the two Gaussians in each panel are usually discrepant). 



where the probability we would IT16cLSlir6 5*111 when the true 
flux is S p is 

(Sm-Sp) 2 

P{M\S p )=Ae 2 "S. , (2) 

under the assumption that our noise is Gaussian distributed. 
We have weakly tested this assumption in § 13.11 and 13.21 
P(M) acts as an overall normalisation in equation Q, and 
does not depend upon S p as long as the noise is not corre- 
lated with sources on the sky. Strictly speaking, P(M\S P ) 
should be altered from the form we have used to account for 
the fact that we are examining the probability at a location 
where we have found a peak. In practice, at S m — S p > 3er 
the f ull expression converges t o the simpler form we have 
used llBond fc Efstathioulll987l) ■ 

The posterior flux probability distribution, P(S P ) is 
shown as a solid histogram in the panels of Fig. 2] for each 
of the 5 sources for which we also have follow-up photom- 
etry information. The dot-dashed Gaussian in each panel 
is P(M\S P ), which is often incorrectly adopted as the flux 
estimate of a map source. 

In Fig. 0|we have placed the individual P(S P ) plots in 
order of increasing map SNR. It is clear that one expects 
to measure a non-zero flux value a significant fraction of 
the time only for sources with relatively high SNRs. Sources 
with modest SNRs are much more likely to have non-zero 
photometry results than lower SNR sources. The peak in 
the a posteriori distribution at zero flux dominates for SNR 



< 3.5. This confirms the usual prejudice towards high SNR 
sources - if a source is bright, it needs to be detected with 
a SNR > 4<r in order to be deemed a secure detection. 
Moreover, we also find that at the same SNR level, appar- 
ently brighter sources (with consequently higher flux uncer- 
tainty) are more likely to be spurious (i.e. flux boosted from 
~ mJy) than fainter sources. Thus at a given SNR, low flux 
sources are more likely to be real than high flux sources. 

We performed independent photometry observations on 
5 sources with SNRs ranging from 3.0-4.1 a and the results 
are shown in Tabled We wish to know if the photometry 
measurements are consistent with the map-detected fluxes. 
In other words, we want to answer the question: 'What is 
the probability that we will measure S p in photometry mode 
given the map-detected flux (Sm) and uncertainty (cr m ) and 
the underlying source count model?'. The main point is that 
the a posteriori probability of finding a bright source will be 
down-weighted by the a priori probability coming from the 
source counts. 

A comparison of the dashed curves and solid histograms 
in Fig. [I] shows that our photometry results are consistent 
with P(S P ), even though the photometry is often inconsis- 
tent with the raw (i.e. flux boosted) map readings P(A'I\S P ). 

4.3 A Revised Source List 

We can now assess the probability of obtaining each of the 
photometry measurements using the distributions in Fig. 0] 
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For GSS850.6, a photometry result lower than what we mea- 
sured is expected 43 per cent of the time. For the remaining 
sources GSS850.1, GSS850.2, GSS850.11 and GSS850.7, we 
have assessed that a photometry measurement lower than 
the one we obtained would have occurred 12, 60, 6, and 18 
per cent of the time, respectively. A Kolmogorov-Smirnov 
(KS) test performed on these results determined that the 
set of 5 trials is consistent with a uniform distribution. 

The photometry results are thus completely within the 
realm of what is expected, despite the apparently contra- 
dictory results presented in Tabled We confirm GSS850.2 
and GSS850.7 as bona fide 850 (jm sources, and we confi- 
dently eliminate GSS850.1, GSS850.6, and GSS850.11 from 
our candidate source list. Note that these eliminated sources 
are also among the 4 noisiest candidate detections in the 
map, which makes it even less surprising that they are spu- 
rious. We have tallied up the number of detections, expected 
sources, and spurious detections and have illus trated these 
in Fig. HI We note that t he number counts (e.g. IScott et alJ 
l200l iBorvs et alJl200l IWebb et alJl2003Fl predict the de- 
tection of around 3 sources at the flux limit of the map 
(~ lOmJy). 

Our revised source list now includes 2 confirmed sources 
(with coordinates given in Table |2J as well as 1 other candi- 
date which is > 3.5 a in the map. Although not confirmed by 
photometry, Figs. [5] and 2] suggest that SNR > 3.5 sources 
have a reasonable chance of being real. We are able to cal- 
culate a best estimate of the flux for each of these objects 
using the combination of all available information, including 
the map measurements, photometry measurements and the 
source count prior. To do this we multiply the measured (as- 
sumed Gaussian) photometry flux probability distribution, 
P(S p ,a p ), by the calculated posterior probability for the 
map flux (equation Q), which we take as the prior distri- 
bution for S p , and we normalise it to have unit integral. For 
the candidate source, GSS850.4, we only know the a poste- 
riori distribution for the map measurement, since we do not 
have a photometry measurement for this object. In Table [5] 
we give the peak of these new distributions, along with the 
error bars describing the 68 per cent confidence regions. 



5 A FIRST ATTEMPT AT 

MULTI- WAVELENGTH CORRELATIONS 

We now use our new candidate source list (see Table |SJ to 
search for close counterparts at other wavelengths in other 
data sets which overlap with our coverage. We also perform 
stacking analyses to see if there is any overlap between the 
catalogues and maps. 

The 450 /im map of this region is of poor quality; the 
data are shallow (since the sensitivity at 450 /im is worse) 
and inhomogeneous (being more prone to changes in the 
weather) . We do not detect any of our 850 /im sources in 
the 450 /xm map, but we present 95 per cent confidence up- 
per limits to the 450 fim flux for each 850 fim detection in 
Table |21 The 450 /im average (or 'stacked') flux density at 
the 3 850 /im-detected positions is 10 (±23) mjy. 

Using an 80 ksec XMM-Newton ob servation encompass- 
ing the northeast part of the GSS, iMivaii et alJ (l2004h 
have uncovered about 150 sources down to flux limits of 
~ 1 X 10" 20 and ~ 2 x 10" 20 Wm -2 in the soft (0.5-2 keV) 



and hard (2-10 keV) X-ray bands, respectively. Of these de- 
tections, 7 lie within our submillimetre map and the X-ray 
positional errors are typically about 2-3 arcsec. No X-ray 
counterparts exist within the anticipated error circle of 4", 
and indeed even there are no counterparts within a full beam 
of any SCUBA source. However, the stacked 850 (j,m flux 
from the 7 X-ray positions lying within the submillimetre 
map region is 2.5 (±1.1) mjy. This corresponds to a 0.8 mjy 
95 per cent confidence lower limit to the mean flux of these 
sources. 

These X-ray sources are ther efore brighter than Lym an- 
break galaxies at 850 (e.g. IChapman et alJ |2000T) ! If 
AGNs do not comprise a large fraction of our sources, this 
result indicates that the X-ray emission originates from pro- 
cesses related to star-formation. This result illustrates that 
this map can, in fact, be used to make statistical remarks 
about ~ 1 mjy sources even though individual detections 
are hopeless, and shows a path t o populating the co nfusion 
sea in the submillimetre (see also lBorvs et al.|l2004l) . 



6 CONCLUSIONS 

We have mapped approximately 70 arcmin 2 of the Groth 
Strip at 850 /im with SCUBA on the JCMT to a 1 a depth 
of around 3.5 mjy. 

Using a robust source detection algorithm, we have 
found 11 candidate sources with SNR > 3<r. Monte Carlo 
simulations suggest that most of these will either be spuri- 
ous or considerably flux boosted. Follow-up photometry ob- 
servations have confirmed 2 of them and rejected 3. Based 
on these follow-up photometry data, we have determined, 
not surprisingly, that candidate sources in high-noise regions 
of the map have implausibly high apparent fluxes at SNR 
> 3<t, and are likely to be spurious false-positive detections. 
We reiterate that bright sources detected in a map should 
have SNR > 3.5 a before they have a reasonable chance of 
being real, and SNR > 4<r before they should be believed 
with any confidence. Our final source list for the GSS con- 
tains 2 confirmed SCUBA sources and 1 further candidate 
source with SNR > 3.5 a. Using a combination of the un- 
boosted map flux posterior probability distributions and the 
photometry measurements (when available) , we present best 
estimates of the flux for these objects. 

We have measured a mild statistical detection of 
low flux (~ 1 mjy) sources at X-ray wavelengths through 
a stacking analysis, and it may be that similar com- 
parisons with data at other wavebands might also be 
fruitful. We have therefore made our maps available at 
http : / / cmbr .physics .ubc . ca/groth 

Our simple Bayesian method for correcting the effects 
of flux boosting should be useful for future surveys such as 
SHADES and those carried out with SCUBA-2, as well as 
for other instruments which provide data in the low SNR 
near-confusion regime. It may also be useful to adapt this 
method in order to find sources, by searching for pixels in a 
map for which the posterior probability for S p > is above 
some threshold. 



© 0000 RAS, MNRAS 000, 000-000 



8 Coppin et al. 



Table 2. GSS 850 fim revised source list. We have included the 2 photometry-confirmed sources and an additional > 3.5 candidate source 
which has a reasonable chance of being real. The best 850 (im flux estimate is given based on the combination of the posterior probability 
of the map flux given the data together with the photometry flux for the sources with photometry (2 and 7), and based solely on the 
posterior probability of the map flux for GSS850.4. The reported flux is the most likely flux in the 68 per cent confidence region, with 
upper and lower error bars shown to indicate the range of that confidence interval. 95 per cent Bayesian upper limits are also given for 
the 450 (im flux of each 850 (im detection. 



Object Position (2000.0) S s50 S450 

RA Dec (mjy) (mjy) 

GSS850.2 14 h 15 m 25?0 +52°02'57" 5.9±5;£ < 170 

GSS850.4 14 h 15 m 29?5 +52°04'48" 6.9^4 ° < 48 

GSS850.7 14 h 16 m ll!6 +52°13'42" 4.8j;H < 118 
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